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Abstract—Aircraft landing time (ALT) prediction is crucial for 
air traffic management, especially for arrival aircraft sequencing 
on the runway. In this study, a trajectory image-based deep learn- 
ing method is proposed to predict ALTs for the aircraft entering 
the research airspace that covers the Terminal Maneuvering 
Area (TMA). Specifically, the trajectories of all airborne arrival 
aircraft within the temporal capture window are used to generate 
an image with the target aircraft trajectory labeled as red and 
all background aircraft trajectory labeled as blue. The trajec- 
tory images contain various information, including the aircraft 
position, speed, heading, relative distances, and arrival traffic 
flows. It enables us to use state-of-the-art deep convolution neural 
networks for ALT modeling. We also use real-time runway usage 
obtained from the trajectory data and the external information 
such as aircraft types and weather conditions as additional inputs. 
Moreover, a convolution neural network (CNN) based module is 
designed for automatic holding-related featurizing, which takes 
the trajectory images, the leading aircraft holding status, and 
their time and speed gap at the research airspace boundary as 
its inputs. Its output is further fed into the final end-to-end ALT 
prediction. The proposed ALT prediction approach is applied to 
Singapore Changi Airport (ICAO Code: WSSS) using one-month 
Automatic Dependent Surveillance-Broadcast (ADS-B) data from 
November 1 to November 30, 2022. Experimental results show 
that by integrating the holding featurization, we can reduce the 
mean absolute error (MAE) from 82.23 seconds to 43.96 seconds, 
and achieve an average accuracy of 96.1%, with 79.4% of the 
predictions errors being less than 60 seconds. 

Keywords—Air Traffic Management, Aircraft Landing Time, 
Trajectory Image, Convolution Neural Networks. 


I. INTRODUCTION 


The aircraft landing time (ALT) is the time when an 
aircraft touches down on the runway [1]. The ALT and its 
prediction are of great significance for many stakeholders and 
purposes. For example, airlines can inform passengers about 
the estimated arrival time. Airports and flight operators can 
use the ALT information to plan the support services for 
inbound flights, such as parking, fueling and loading [2]. 
Furthermore, accurate predictions of ALT are essential for 
optimizing arrival aircraft schedules [3]. The arrival manager 
(AMAN) systems depend on ALT estimation to sequence the 
runway slot efficiently [4], and to provide precise information 
(e.g., time to gain, time to lose) for air traffic controllers 
(ATCOs) to conduct speed control [5]. 

Given the importance of ALT and its prediction, various 
data-driven machine learning-based approaches have been de- 
veloped for the ALT prediction. The most commonly used data 
is the trajectory data because aircraft trajectories of airborne 


flights contain plenty of information for real-time air traffic. 
The aircraft trajectories are helpful to extract the aircraft 
location, speed, traffic context, relative spacing between air- 
craft, etc [6]. The study in [7] uses airborne trajectories in 
the extended terminal maneuver airspace of WSSS to obtain 
the latitude, longitude, altitude, speed, rate of climb, heading 
of the aircraft and the entry zone of an aircraft for ALT 
prediction. The study in [8] explores another set of features 
from the trajectories for modeling ALTs, including the calcu- 
lated arrival pressure, and the calculated sequencing pressure. 
Meanwhile, researchers in [9] obtain the traffic densities at 
different granularities from the trajectory data. Though various 
information from aircraft trajectories is extracted for the ALT 
prediction, these studies do not involve aircraft holding stages, 
which makes the ALT prediction more challenging. 

The most commonly applied machine learning methods for 
ALT prediction are the decision tree-based models [9], [10]. 
For instance, the study in [9] tested several decision tree-based 
machine learning models for the ALT predictions, including 
random forest and the gradient boosting machine(GBM). The 
performance of these kind of models depends on the features, 
which requires amounts of feature engineering work. Data 
missing may make the models lose efficacy since missing data 
is removed as in [11]. Besides, manual feature engineering- 
based methods [12] depend on the data quality and human 
knowledge of the research problem. Conversely, deep learning- 
based methods can help capture the unseen patterns of the 
data. The research in [13] proposes a deep learning-based 
method for ALT prediction, where the the time series-based 
deep learning model, LSTM, is used on the non-linear (tanh) 
combination of the raw trajectory’s latitude, longitude, altitude, 
and speed. Unlike CNNs, the time-series based model cannot 
directly capture the spatial context among aircraft in TMA, 
e.g., the distances between aircraft. 

In this study, we propose using trajectory images instead 
of raw aircraft latitude, longitude, speed, heading information 
for ALT modeling. Specifically, we define a TMA research 
airspace circle (TRC) that is centered around the airport and 
covers the TMA for the problem formulation. For the case 
study of Singapore Changi Airport, the TRC has a radius 
of 50 nautical miles (NM), which is sufficient to cover the 
TMA of the airport since its TMA is located at a distance of 
40NM from the airport. For each target aircraft that arrives at 
TRC, we generate a trajectory image with the target aircraft 
trajectory colored in red and the background aircraft trajectory 


TABLE I. ACRONYM (VARIABLE) AND CORRESPONDING DEFINITION 


Acronym (Variable) Definition Description 
TRC TMA Research Airspace Circle 50 NM to Changi Airport 
TBX TMA Extended Boundary 60 NM to Changi Airport 
TBE between TRC and TBX flying distance is around 10 NM 
Tic timestamp when an aircraft 7 arrives at TRC calculated from ADS-B data 
Tani timestamp when an aircraft 7 arrives at the runway threshold calculated from ADS-B data 
te ) timestamp of leading aircraft of j arrives at TRC L£(j) is the leading one of 7 when j arrive at TRC 
t; landing time for aircraft 7 as Tour - Tire target prediction variable 
UE mean speed of aircraft 7 within TBE in knots, calculated from ADS-B 
ue () speed of the leading aircraft of 7 within TBE in knots, calculated from ADS-B 
UE the average vuE values for all aircraft within TBE in knots, calculated from ADS-B 
T trajectory image capture window with size 7 seconds time window as [Tec —T, Tre 
6 tabular feature capture window with size 6 seconds time window as [Tee — 6, Tire] 
xe generated trajectory image for aircraft 7 in capture window 7 used to represent TMA context 
x? tabular features for aircraft 7 captured in window 6 runway usage, weather, aircraft type, etc. 


colored in blue. The trajectory images inherently contain 
the aircraft’s location, speed, heading, relative distance, entry 
zone, and arrival traffic flow in the TRC airspace. Based 
on the generated trajectory images, we design an end-to-end 
deep learning model that leverages state-of-the-art convolution 
neural networks for predicting aircraft landing times. Contri- 
butions of this study are summarized as follows: 


e Trajectory images are proposed to be generated and 
utilized in the ALT modeling. These images can capture 
the aircraft’s position, speed, heading, entry zone, arrival 
traffic flow, and relative distances among airborne flights. 
The trajectory image generation can reduce our major 
efforts in the complicated feature engineering process 
and enable us to apply state-of-the-art convolution neural 
networks for the ALT modeling problem. 

e A CNN-based module is designed for the automatic 
holding-related featurization. Except for trajectory im- 
ages, this module also takes the holding status of the lead- 
ing aircraft, the time/speed gap at TRC with the leading 
aircraft, and the speed variation at TRC as its inputs. Its 
output is further fed into the final ALT prediction to make 
it capable of capturing the holding-related features, and 
hence an end-to-end ALT prediction model is proposed. 

e The proposed ALT prediction approach has been testified 
with the case study of Singapore Changi Airport, where 
one-month ADS-B data from November 1 to November 
30, 2022 has been used. Experimental results show that 
incorporating the holding featurization module reduced 
the mean absolute error (MAE) from 82.23 seconds to 
43.96 seconds. The bad prediction ratio dropped by 1.31 
percent. Furthermore, 79.4% of the prediction errors are 
less than 60 seconds. 


The rest of this paper is organized as follows. Section II 
presents the methodology. Section III presents the experimen- 
tal evaluations and corresponding analysis of the prediction 
accuracy. Conclusions and discussion for future work are 
finally drawn in Section IV. 


Il. METHODOLOGY 
A. Problem Statement 


For clear and concise expression, acronyms, variables, and 
corresponding definition used in this study are summarized in 
Table I. In this study, we focus on predicting the landing time 
of each inbound aircraft when it arrives at TRC of Singapore 
Changi Airport. The landing time t; for an aircraft 7 is the 
time calculated as t; = Tyyp — Tigcs where Tryp and Tike 
are respectively the timestamps when the aircraft 7 arrives at 
the runway threshold and the TRC (50NM to the airport). 
Both timestamps are extracted from aircraft trajectory data. 
For predicting t;, we intend to construct a mapping function 
BHF CG). In this study, we utilize deep neural networks 
to construct the mapping function, and the input feature xr 
is a combination of the generated trajectory image X ¥ 7 within 
the capture window [ane —T, eek and the tabular features 
x? captured in the time window bee — 6, ek Hence, 


j 
the objective is to construct the mapping function 


ty =F(XP, XX"), 5 T (1) 

where xX; xX ey are respectively the trajectory image and 
tabular features for aircraft 7. In the following subsections, we 
elaborate on the methodology overview, dataset description, 
data analysis, feature engineering (including the trajectory 
image generation and the related tabular feature extraction), 
holding featurization, and deep convolution neural networks 
for the proposed approach. 


B. Methodology Overview 


The framework for the proposed aircraft landing time pre- 
diction method is described in Fig. 1. The ALT prediction 
takes the generated trajectory image, the aircraft information, 
runway threshold usage, weather condition, and the output 
from the Holding Featurization as its inputs. Historical trajec- 
tories of aircraft are used to generate the trajectory images and 
extract the runway arrivals, which are the inputs for both the 
Holding Featurization and the ALT Prediction. Additionally, 
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Figure 1. Overview of the Proposed Method. 


the leading aircraft holding status, the time gap and speed gap 
with the leading aircraft at TRC, and the speed variation at 
TRC are used as the inputs of the Holding Featurization. 


C. Dataset 


1) Trajectory Data: The trajectory data source of this 
study is Automatic Dependent Surveillance-Broadcast (ADS- 
B) flight data. One-month ADS-B data from November 1 to 
November 30, 2022, have been used in this paper. Even though 
the ADS-B dataset is of high quality, where the data sampling 
granularity is typically one positioning point per second, some 
of the data segments have slight missing points, hence a 
quadratic interpolation is used to impute the missing points for 
both latitude and longitude. By matching the imputed ADS-B 
positioning points to Changi airport runways, we extract the 
arrival aircraft data and get the corresponding landing runway 
for each aircraft. After removing the outliers (outside 3 times 
stand deviation to the mean value), we have 8396 aircraft 
for this study. Since the TRC is 50NM from the airport, the 
longitude range is (103.0, 105.0) and the latitude range is (0.5, 
2.25). Fig. 2 shows the one day’s trajectories of arrival flights 
with the distance circles. 

2) Meteorological Data: the meteorological data (METAR) 
for Singapore Changi airport station are obtained from Iowa 
Environmental Mesonet from Iowa State University [14], 
where the wind direction, wind speed in knots, visibility in 
miles, wind gust in knots, sky coverage and the altitude in 
feet are provided. 

3) Flight Plan and Online Aircraft Performance Database: 
Flight Plan (FPL) and online aircraft performance database 
(APD) [15] are used to obtain the aircraft RECAT-EU. Each 
record in the FPL contains the aircraft model type (e.g., 
A320), and the online APD provides the mapping from model 
type to the RECAT-EU. The aircraft are classified into six 


categories of RECAT-EU, namely, Light, Lower Medium, 
Upper Medium, Lower Heavy, Upper Heavy, and Supper 
Heavy. Our dataset contains five of them, as “light” aircraft 
did not land at Singapore Changi Airport in November 2022. 


D. Data Analysis 


1) Landing Time Analysis: As described in the problem 
statement section, the landing time t; for aircraft 7 is calcu- 
lated as the time gap between the timestamp for landing at 
the runway threhold 7742, and the timestamp 77x- when the 
aircraft arrives at TRC, where both timestamps are extracted 
from the ADS-B trajectory data. The landing time distribution 
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Figure 2. Arrival Aicraft Trajectories in One Day for Changi Airport. 
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Figure 3. Aircraft Landing Time vs RECAT-EU. 


for all aircraft categories during a day is shown in Fig. 3. 
It indicates that *Super Heavy” aircraft tends to have longer 
landing times, and ”*Lower Medium” aircraft tends to have 
the shorter landing times. Hence, it’s essential to involve the 
RECAT-EU of an aircraft in the ALT prediction modeling. 

2) Runway Threshold Usage for Aircraft Landing: Specific 
to the runway 02L20R, the runway threshold used during a 
day can be changed due to the runway operation (e.g., due 
to wind direction change, etc.). The runway thresholds for the 
physical runway 02L20R are 02L and 20R. The actual runway 
threshold usage for aircraft landing in two days are shown in 
Fig. 4. The figure shows that the runway threshold 02L is used 
before 12:00 on Day 1 and changed to 20R at 12:00. On Day 
2, the runway is closed during the early morning and changed 
from 02L to 20R between 12:00-13:00. The runway threshold 
changes for landing can cause extra waiting time for aircraft 
in the TMA. Accordingly, we use a runway change label (0/1) 
in the ALT modeling. 
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Figure 4. Runway Threshold Usage in Two Days. 


3) Holding Detection and Impact on ALT: An aircraft may 
have to enter a holding stage due to the air traffic control, 
which makes the landing time prediction more difficult. In 
this study, we propose to integrate the holding potentials of 
each aircraft for the aircraft landing time prediction. In this 
section, we will analyze the aircraft holdings and their effect 
on the ALTs. The holding detection is based on the ADS- 
B data [16]. The trajectories with holdings in two days are 
shown in Fig. 5. We can note that arrival flights can enter 
the TMA of Changi airport from four directions, and in each 
direction, flights may enter a holding stack to follow the air 
traffic control. The holdings mostly happen between 30NM 
and 50NM to the airport, and we set the TRC as 50NM to 
cover these holdings. 

To illustrate the effect of holdings on the ALTs, we compare 
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Figure 5. Trajectories with Holdings in Two Days. 
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Figure 6. Aircraft Landing Times vs Holdings. 


the ALTs for aircraft with and without holdings in Fig. 6. It 
shows that flights with holdings have higher ALTs than the 
flights without holdings. For this reason, we propose to utilize 
a module to capture the holding probability-related features 
for the ALT prediction. The following subsection presents the 
input features for the proposed approach. 


E. Feature Engineering 


1) Factors and Representation: As we analyzed in the 
previous subsection, various factors can affect t;, such as 
the TMA traffic context, the weather conditions, the runway 
settings and operations, as well as the control intent for 
safety guarantees. In this study, we explore how to integrate 
these factors for predicting the aircraft landing time t;. We 
summarize the potential infecting factors in Table H. Three 
categories of features are used: the TMA context, the runway 
operation, and the external information. The TMA context 
is represented as the generated trajectory image, which is 
generated from the ADS-B data to capture the aircraft position, 
speed, heading, entry zone, and the TMA arrival traffic. We 
will describe each type of features in the follow subsections. 

2) Trajectory Image Generation: In this study, we generate 
the trajectory image for representing the TMA context. Given 
the capture window size T, we generate the trajectory image 
for an aircraft 7 when it arrives at TRC. The track position 
points for all aircraft inside TRC and within the time window 
[Tizc —T> Trpcl are collected to plot the image. Specifically, 
the trajectory of the target aircraft 7 are marked in red, and 
all other planes’ trajectories are marked in blue. 

Fig. 7 shows examples of the generated images for aircraft 
landing from four different directions to Singapore Changi 
Airport. The image generation is based on the ADS-B data, 


TABLE II. FEATURES FOR ALTS AND THEIR REPRESENTATIONS. 


Indicator Description Representation Feature Category Data Source 
aircraft position pixel positions in the image 
aircraft speed trajectory length in the image 
aircraft heading tail to head direction trajectory image TMA Context ADS-B 
entry zone pixel positions of the target aircraft 
arrival traffic number of trajectories in the image 
dynamic runway setting runway threshold change label O/T Runway Operation 
runway usage arrivals on each runway real number 
Weather dret, sknt, gust, vsby, skyll, skycl real number (in knots), 0/1 METAR 
seasonality is_peakhour, is_weekday O/T External Info Flight Plan/APD 
AIC info Aircraft RECAT-EU 0,1,2,3,4,5 
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(a) Trajectory Image Example, Landing from West 
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(c) Trajectory Image Example, Landing from North 


(b) Trajectory Image Example, Landing from East 


(d) Trajectory Image Example, Landing from South 


Figure 7. Examples of Generated Trajectory Images for Aircraft Landing in Four Different Directions, 7 = 60,6 = 10. 


which provides one point per second for each aircraft (missing 
data points are imputed with interpolation). For a given 7 
value, all aircraft in one image have the same flying time, so 
the relative length of the trajectory indicates the aircraft speed. 
That is, longer trajectory in one image means a higher speed 
of the aircraft. Moreover, the aircraft heading is determined 
by the direction from the last point to the first point of 
the trajectory, where the first point is usually closer to the 
runway. By generating such a trajectory image, we can capture 
several kinds of TMA context simultaneously in the image as 
described in Table. II: the aircraft position, speed, heading, 
the entry zone, relative distances. Additionally, the number of 
trajectories in the image directly represents the arrival traffic 
flow in the capture window. 

3) Runway Operation Feature: We use the ADS-B data to 
determine the landing runway and runway threshold for each 
aircraft. We also count the number of arrival aircraft for each 
runway within the tabular feature capture window [Tigo — 6, 
Tek. Furthermore, we set the runway threshold change label 
to 1 if the runway threshold is changed during the capture 
window [Tic — 5, Tic], and to 0 otherwise. 

4) External Info: We obtain the aircraft model type, e.g., 
A320, from the flight plan dataset and use it to get the 
corresponding model RECAT-EU type, e.g., Upper Medium, 
from the online APD. Then we encode the RECATEU type 


as a number from 0 to 5 (0 for light, 5 for super heavy), and 
use it as one dimension of the input feature for x . For the 
weather-related features, we utilize the wind direction (drct), 
wind speed (sknt), visibility (vsby) in miles, wind gust in 
knots, sky level 1 altitude in feet, and the sky level 1 coverage, 
which are all from the METAR dataset. We extract all these 
features, as well as the seasonality features, within the capture 
window [Tro — 9, Tercl. and each is set as one dimension of 
X°) for the ALT modeling. 


FE. Holding Featurization 


As shown in Fig. 1, we design a module to estimate the 
holding potential of an aircraft for the ALT prediction. The 
Holding Featurization module takes trajectory images and 
other tabular features as its input. In this section, we describe 
the tabular features for the Holding Featurization module. 
These include the runway arrivals that are also the input for the 
ALT Prediction module. Besides, the time gap and speed of an 
aircraft with its leading aircraft at TRC, and the variation of 
the speed compared to the average speed within TBE are used. 
As Fig. 7c illustrates, the target aircraft’s leading aircraft enters 
the holding status, which have a high probability of making the 
target aircraft also enter the holding stack. We explain these 
features in detail below. 
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Figure 8. Statistical probability of time gap at TRC, speed gap vs average 
(avg) speed in TBE, and the speed gap vs its leading aircraft’s speed in TBE. 
Blue: without holding, Orange: with holding . 


e the arrivals on runways: the number of aircraft entering 
holding stage increases when the the runways are busy. 
During this period, parallel landing can be conducted. 

e The time gap ATI4) = seen - Ta), denoting the 
the time gap between the target and its leading aircraft 
arriving at the TRC, here £(j) is the leading aircraft of 
aircraft 7 when aircraft 7 arrives at TRC; 

e the speed variation Avj*"? = uj — Up, which is the gap 
between the speed of aircraft 7 with the average speed of 
all aircraft in TBE, where vg is the average speed of all 
aircraft in TBE. 

e The speed gap Av , which representing 
the speed difference between aircraft 7 and its leading 
aircraft £(j) within the 10 NM in TBE. 

e the holding status of the leading aircraft £(7), which is 
a 0/1 label. 

The statistical probability of AT?£, Avk*"9, and 
Avpt®) are shown in Fig. 8, which is subject to the with 
holding and without holding aircraft. It indicates that holding 
aircraft tend to have smaller values in all three features. Hence, 
these features are extracted and set as the input for the holding 
featurization module. Besides, the holding status of a target 
aircraft’s leading aircraft (0/1) is set as an additional input. 
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G. Deep Convolution Networks 


1) Mobile Convolution Network for ALT Prediction: The 
MobileNetV2 has better efficiency since it utilizes depthwise 
separable convolutions, which are the key building blocks for 
many efficient neural network architectures [17]. Assume that 
the dimension of the input tensor L, in layer 7 is h; x w; Xx d;, 
the dimension for the output tensor L; is h; x w; x d;, and then 
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Figure 9. Two kinds of blocks in MobileNetV2 [17]. 
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Figure 10. Architecture of EfficientNet-BO [18]. 


the standard convolution’s kernel size is K € R***<4ix4s, 
which require the computation cost of hy - w;-d;-dj;-k-k. 
Whereas, for depthwise separable convolutions, the compu- 
tation cost is only hj + w; - dj(k? + dj), which reduces 
computation compared to standard convolution by almost a 
factor of k?. Generally, the kernel size is 3, then it means for 
the computation cost of depthwise separable convolution is 8 
to 9 times smaller than that of standard convolutions [17]. 

The main innovation of MobileNetV2 is that it is an inversed 
residual network [17], which means the residual connection is 
used only when the stride is | and the number of channels 
remains the same. Two kinds of blocks in MobineNetV2 are 
shown in Fig. 9. Depth-wise convolution uses the kernel 3 x 3, 
and point-wise convolution uses 1 convolution. Between the 
residual connections, it contains the decompression phase, 
where the output channels are expanded among connected 
layers. In this study, we use the default settings for the 
expanding ratio, i.e., 6, as in the original code [17]. 

2) Efficientnet for Holding Probability: The holding proba- 
bility is as part of inputs for the landing time prediction, which 
is captured by the Efficientnet [18] with the trajectory image as 
its input. The Efficientnet proposes a new compound scaling 
method, which uses a compound coefficient ¢ to uniformly 
scales convolution network’s width, depth, and resolution in 
the following principled way [18]: 


depth: d= a? 
width : w = B? 
resolution : r = 7° 
s.t.,a- B?.7? 22 
a>l1,pel,y>l1 


(2) 


where a,(,y are constants that can be determined by a 
small grid search. a - 8? - 7? = 2 is to make the FLOPS 
(floating point operations per second) approximately increase 
by 2%. In this study, we utilize the EfficientNet-BO, where 
the parameters are @ = la = 1.2,6 = 1.1,y = 1.15, 
and the main building block is the mobile inverted bottleneck 
MBConv. Its architecture is shown in Fig. 10. 


III. EXPERIMENTAL RESULTS AND ANALYSIS 
A. Experimental Settings 


We randomly split the total 8396 aircraft in the dataset 
D into 70%, 15%, 15% for model training, validation, and 
testing, respectively, where features are normalized first. We 
set the loss function as the L1 loss, ie., the Mean Average 


TABLE III. PREDICTION ACCURACY EVALUATION OF EACH MODEL FOR ALL SCENARIOS. 


MAPE MAE RMSE BadRatio (y = 0.3) 
Metrics d(in Mins) | 7 =60 | 7=90 | 7 =60 | 7=90 | 7=60 | 7 =90 | 7 =60 T = 90 
10 0.0738 0.0899 82.23 102.04 115.91 140.44 0.0156 0.0175 
Baseline 15 0.0789 0.0829 86.10 91.38 116.17 124.81 0.0166 0.0191 
(No holding Module) 20 0.0808 0.0799 88.11 89.28 120.87 123.76 0.0216 0.0175 
25 0.0767 0.0786 86.81 87.05 123.28 119.23 0.0183 0.0158 
30 0.0788 0.0974 87.22 101.23 120.65 130.20 0.0166 0.0308 
10 0.0390 0.0759 43.96 86.76 71.91 123.95 0.0025 0.0208 
15 0.0762 0.0766 84.86 85.58 116.12 116.89 0.0066 0.0150 
Proposed Method 20 0.0859 0.0740 94.36 84.89 126.61 121.13 0.0250 0.0125 
25 0.0763 0.0727 86.23 81.95 119.45 115.40 0.0108 0.0141 
30 0.0772 0.0786 86.05 88.64 118.97 123.98 0.0141 0.0175 


Error (MAB), the optimizer as Adam, the batch size as 64, the 
epoch as 1000, and the learning rate as 0.001. The network 
has 6.4M parameters in total. 

For the parameters of EfficientNet_BO, all parameters are 
set as in [18]. For MobileNetV2, a pretrained mode is used, 
and all parameters are as in [17]. Since MobileNetV2 [17] is 
designed for classification problems, however, our problem is 
a regression problem, hence, we use these networks for feature 
engineering and the final classification layer is replaced by fol- 
lowing sequential layers: Linear(1280, 64), BatchNorm1d(64) 
LeakyReLU(Linear(128)), Dropout(0.1). The MLPs in the 
proposed method as shown in Fig. | are structured as following 
sequential layers. (1) MLP_NI1: Linear(5, 16), LeakyReLU. 
(2) MLP_N2: Linear(48, 32), BatchNorm1d(32), LeakyReLu, 
Linear(32, 8), Dropout(0.1), Sigmoid. (3) MLP_N3: Lin- 
ear(12, 16), BatchNorm1ld(16), LeakyReLu. (4) MLP_N4: 
Linear(80, 64), BatchNorm1d(64), LeakyReLu. 


B. Evaluation Metrics 


To measure the accuracy of the ALT predictions, we adopted 
the commonly used metrics RMSE (Root Mean Square Error) 
(3), MAE (Mean Absolute Error) (4), MAPE (Mean Absolute 
Percentage Error) (5), as well as the defined BadRatio, (6), 
which are described below. 


1 nm 
MSE = ,| — ioe" 
RMS . 2, li — Gil (3) 
mar =2 3 — % (4) 
— a Yi — Yi 
1 |yi — 9: 
MAPE =~)” yi = 9) (5) 
a 
n lys—al 
"oo (eH 
BadRatio, = dit CG v (6) 
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Here, nm is the number of test samples, y; is the ground 
truth label for the test sample 7, and %; is the corresponding 
estimation. For all four metrics, lower values denote better 
estimations. RMSE is sensitive to the outliers and MAE is 
actually the L1 loss that directly represents the average estima- 
tion errors. MAPE (5) helps generally evaluate the estimations 
in terms of large ranges of ground truth values. BadRatio, is 


(0.085 


— Baseline 
—— Proposed 


0.2 0.3 0.4 
Absolute Percentage Error 


O21 


Figure 11. CDF of Absolute Percentage Error. tr = 60, 6 = 10. 


to evaluate the robustness of the model, where o() is a sign 
indicator function, ie., o(2) = 1 if the expression x is true, 
otherwise 0. + is a manually set threshold value, e.g., y = 0.3 
represents that the absolute percentage errors lvi=Usl are higher 
than 30%, i.e., the relative accuracy would be lower than 70%. 
Hereafter, the metric BadRatio, denotes the ratio of bad 
predictions over the total number of test samples, Hence, a 
lower BadRatio., presents a better robustness of the model. 


C. Result Analysis 

In the experiment study, we remove the ”Holding Featuriza- 
tion” as a baseline to testify its effect on the ALT prediction. 
Additionally, the predictions are evaluated on different settings 
of the window size parameters t = 60,90 (in seconds) 
and 6 = 10,15, 20, 25,30 (in minutes). Since the results of 
T = 30 are quite worse, the corresponding results are not 
shown. Measured by the metrics of RMSE, MAE, MAPE and 
BadRatio,, the experimental results are shown in III, where 
7 = 0.3. The predictions measured by all metrics in III show 
that both the baseline and the proposed method performs better 
with 7 = 60. Meanwhile, the proposed method performs better 
than the baseline on all metrics (r = 60,6 = 10). 

To further compare the performance of the baseline and the 
proposed method, we calculate the Absolute Percentage Error 
(APE) as in Eq. 7. 


APE = yi = Gil (7) 
Yi 


The cumulative distribution function (CDF) of the APE 
is shown in Fig. 11. We can find that the proposed ALT 


TABLE IV. PERFORMANCE IMPROVEMENT 


T = 60 ratio of bad ratio 
6 = 10 jerr| <60 seconds y=0.3 
Baseline 51.07% 1.56% 
Proposed 79.40% 0.25% 
Improvement 28.37% 1.31% 


prediction method has quite lower APE values on all cumu- 
lative percentages, and it converges to | much faster than 
the baseline. For the proposed ALT prediction method, 90% 
predictions are of lower than 85% prediction errors. 

In addition, as shown in Table. IV, the BadRatio of the 
proposed method is improved by 1.31% compared to the 
baseline with y = 0.3. For the proposed method, 79.4% 
prediction errors are lower than 60 seconds, which has an 
28.37% improvement compared to the baseline, where the 
prediction error is |err| = |y; — Y|- 


IV. CONCLUSIONS 


In this paper, we propose a novel approach for predicting 
aircraft landing time (ALT) based on deep convolution neural 
networks applied to the generated trajectory images. The tra- 
jectory images capture various information about the aircraft, 
such as its spatial position, speed, heading, entry zone, relative 
distances to other aircraft, and real-time arrival traffic flows in 
the research airspace. These images can replace most of the 
complex feature engineering, and allow us to use state-of-the- 
art CNNs for the ALT prediction problem. Moreover, since 
the aircraft holding stage can significantly affect the ALT, our 
method use a module for the automatic holding-related feature 
engineering, which takes the generated images, as well as the 
leading aircraft holding status, the time and speed gap with 
the leading aircraft at TRC, and the speed variation at TRC 
as its input. Its output is further fed into the ALT prediction, 
and finally an end-to-end ALT prediction model is proposed. 
We conduct our study on the Changi Airport and use the 
related ADS-B dataset, as well as the METAR, flight plan, 
and online APD data in our experiment. Results show that 
our method can achieve an average accuracy of 96.1%, with 
79.4% of the prediction errors being lower than 60 seconds. 
Experimental findings also verified that by integrating the 
holding featurization module with holding-related features, we 
can significantly improve the final ALT predictions. Compared 
to the baseline without the holding featurization, the proposed 
method reduces the mean absolute error (MAE) from 82.23 
seconds to 43.96 seconds, and reduces the bad ratio of the 
predictions from 1.56% to 0.25%. 

Predicting ALTs for inbound aircraft is crucial for sequenc- 
ing them efficiently. In this paper, we only predict the ALT for 
each aircraft at the research airspace boundary (TRC), which is 
SONM to Changi airport in our case study. However, predicting 
the ALT with longer distances (e.g., 10O0ONM), choice of the 
Standard Arrival Route (STAR), and actions of the air traffic 
controllers would be further beneficial for managing arrival 
aircraft. Therefore, we plan to improve our proposed ALT 
prediction method by considering these factors in the future. 
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